Search CORE

1,063 research outputs found

MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks

Author: Chen Yiran
Cheng Hsin-Pai
Li Hai
Li Sicheng
Song Chang
Wu Chunpeng
Wu Qing
Yang Huanrui
Publication venue
Publication date: 11/05/2018
Field of study

Some recent works revealed that deep neural networks (DNNs) are vulnerable to so-called adversarial attacks where input examples are intentionally perturbed to fool DNNs. In this work, we revisit the DNN training process that includes adversarial examples into the training dataset so as to improve DNN's resilience to adversarial attacks, namely, adversarial training. Our experiments show that different adversarial strengths, i.e., perturbation levels of adversarial examples, have different working zones to resist the attack. Based on the observation, we propose a multi-strength adversarial training method (MAT) that combines the adversarial training examples with different adversarial strengths to defend adversarial attacks. Two training structures - mixed MAT and parallel MAT - are developed to facilitate the tradeoffs between training time and memory occupation. Our results show that MAT can substantially minimize the accuracy degradation of deep learning systems to adversarial attacks on MNIST, CIFAR-10, CIFAR-100, and SVHN.Comment: 6 pages, 4 figures, 2 table

arXiv.org e-Print Archive

Crossref

Towards Efficient Hardware Acceleration of Deep Neural Networks on FPGA

Author: Li Sicheng
Publication venue
Publication date: 25/01/2018
Field of study

Deep neural network (DNN) has achieved remarkable success in many applications because of its powerful capability for data processing. Their performance in computer vision have matched and in some areas even surpassed human capabilities. Deep neural networks can capture complex nonlinear features; however this ability comes at the cost of high computational and memory requirements. State-of-art networks require billions of arithmetic operations and millions of parameters. The brute-force computing model of DNN often requires extremely large hardware resources, introducing severe concerns on its scalability running on traditional von Neumann architecture. The well-known memory wall, and latency brought by the long-range connectivity and communication of DNN severely constrain the computation efficiency of DNN. The acceleration techniques of DNN, either software or hardware, often suffer from poor hardware execution efficiency of the simplified model (software), or inevitable accuracy degradation and limited supportable algorithms (hardware), respectively. In order to preserve the inference accuracy and make the hardware implementation in a more efficient form, a close investigation to the hardware/software co-design methodologies for DNNs is needed. The proposed work first presents an FPGA-based implementation framework for Recurrent Neural Network (RNN) acceleration. At architectural level, we improve the parallelism of RNN training scheme and reduce the computing resource requirement for computation efficiency enhancement. The hardware implementation primarily targets at reducing data communication load. Secondly, we propose a data locality-aware sparse matrix and vector multiplication (SpMV) kernel. At software level, we reorganize a large sparse matrix into many modest-sized blocks by adopting hypergraph-based partitioning and clustering. Available hardware constraints have been taken into consideration for the memory allocation and data access regularization. Thirdly, we present a holistic acceleration to sparse convolutional neural network (CNN). During network training, the data locality is regularized to ease the hardware mapping. The distributed architecture enables high computation parallelism and data reuse. The proposed research results in an hardware/software co-design methodology for fast and accurate DNN acceleration, through the innovations in algorithm optimization, hardware implementation, and the interactive design process across these two domains

D-Scholarship@Pitt

A study of energy correction for the electron beam data in the BGO ECAL of the DAMPE

Author: Huang Guangshun
Li Zhiying
Wang Chi
Wang Xiaolian
Wei Yifeng
Wen Sicheng
Xu Zizong
Zhang Yunlong
Zhang Zhiyong
Publication venue
Publication date: 10/11/2015
Field of study

The DArk Matter Particle Explorer (DAMPE) is an orbital experiment aiming at searching for dark matter indirectly by measuring the spectra of photons, electrons and positrons originating from deep space. The BGO electromagnetic calorimeter is one of the key sub-detectors of the DAMPE, which is designed for high energy measurement with a large dynamic range from 5 GeV to 10 TeV. In this paper, some methods for energy correction are discussed and tried, in order to reconstruct the primary energy of the incident electrons. Different methods are chosen for the appropriate energy ranges. The results of Geant4 simulation and beam test data (at CERN) are presented

arXiv.org e-Print Archive

CERN Document Server

Multi-objective Location-Routing Model for Hazardous Material Logistics with Traffic Restriction Constraint in Inter-city Roads

Author: Hu Hao
Li Xiang
Shang Changjing
Zhang Sicheng
Zhang Yuanyuan
Publication venue
Publication date: 01/02/2019
Field of study

Aberystwyth Research Portal

DORec: Decomposed Object Reconstruction Utilizing 2D Self-Supervised Features

Author: Ji Sihui
Li Sicheng
Liao Yiyi
Wang Yue
Wu Jun
Xiong Rong
Publication venue
Publication date: 19/10/2023
Field of study

Decomposing a target object from a complex background while reconstructing is challenging. Most approaches acquire the perception for object instances through the use of manual labels, but the annotation procedure is costly. The recent advancements in 2D self-supervised learning have brought new prospects to object-aware representation, yet it remains unclear how to leverage such noisy 2D features for clean decomposition. In this paper, we propose a Decomposed Object Reconstruction (DORec) network based on neural implicit representations. Our key idea is to transfer 2D self-supervised features into masks of two levels of granularity to supervise the decomposition, including a binary mask to indicate the foreground regions and a K-cluster mask to indicate the semantically similar regions. These two masks are complementary to each other and lead to robust decomposition. Experimental results show the superiority of DORec in segmenting and reconstructing the foreground object on various datasets

arXiv.org e-Print Archive

Corporate Social Responsibility and Maturity Mismatch of Investment and Financing: Evidence from Polluting and Non-Polluting Companies

Author: Bao Xiaolan
Crabbe M. James C.
Li Sicheng
Luo Qiaosheng
Yue XiaoGuang
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

We investigate the influence of corporate social responsibility (CSR) on the maturity mismatch of investment and financing from the perspective of both polluting and non-polluting companies. The results reveal that CSR performance can aggravate the maturity mismatch of investment and financing; and the effect can be more serious in the polluting companies. At the same time, we find that CSR makes companies obtain more short-term debt. What is more, polluting companies perform more environmental responsibilities in the form of long-term investments than non-polluting companies. These phenomena exacerbate the maturity mismatch of investment and financing; and this effect is only significant when polluting companies choose CSR mandatory disclosure. The impact of CSR on the maturity mismatch of investment and financing is more apparent in companies with lower value and at smaller scales. We show that companies should not only perform their CSR to maintain a balanced economic and ecological development, but also pay attention to the aggravation of the maturity mismatch of investment and financing.The paper was supported by the following fund project: General project of philosophy and social science research of the Ministry of Education (project no. 18JHQ083).info:eu-repo/semantics/publishedVersio

Multidisciplinary Digital Publishing Institute

Repositório Científico do Instituto Politécnico do Porto

Dexterous In-Hand Manipulation of Slender Cylindrical Objects through Deep Reinforcement Learning with Tactile Sensing

Author: Hu Wenbin
Huang Bidan
Lee Wang Wei
Li Zhibin
Yang Sicheng
Zheng Yu
Publication venue
Publication date: 11/04/2023
Field of study

Continuous in-hand manipulation is an important physical interaction skill, where tactile sensing provides indispensable contact information to enable dexterous manipulation of small objects. This work proposed a framework for end-to-end policy learning with tactile feedback and sim-to-real transfer, which achieved fine in-hand manipulation that controls the pose of a thin cylindrical object, such as a long stick, to track various continuous trajectories through multiple contacts of three fingertips of a dexterous robot hand with tactile sensor arrays. We estimated the central contact position between the stick and each fingertip from the high-dimensional tactile information and showed that the learned policies achieved effective manipulation performance with the processed tactile feedback. The policies were trained with deep reinforcement learning in simulation and successfully transferred to real-world experiments, using coordinated model calibration and domain randomization. We evaluated the effectiveness of tactile information via comparative studies and validated the sim-to-real performance through real-world experiments.Comment: 10 pages, 12 figures, submitted to Transaction on Mechatronic

arXiv.org e-Print Archive